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Abstract 

^ In this work, we redefined two important statistics CLRT (Bai et.aL, Ann. 

, — I Stat. 37 (2009) 3822-3840) and LW test(Ledoit and Wolf, Ann. Stat. 30 

W (2002) 1081-1102) on identity tests for high dimensional data using classic 

1^ sample covariances. Compared with existing CLRT, the proposed CLRT 

can accommodate data which have non-zero means and non-Gaussian dis- 
tributions and the new LW also can be applied to non-Gaussian data set. 
Simulations demonstrated that the tests had good properties in sizes and 
psj powers. What's more, we found CLRT is more sensitive to eigenvalues less 

^ than 1 while LW test has more advantages on detecting eigenvalues larger 

than 1. 
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^ 1. Introduction 

Suppose Ai,--- ,A„ are independent and identically distributed (i.i.d.) 
p-dimensional random vectors with population covariance matrix Sp and our 
C^ interest is to test 

Hq: T.p = Ipvs. Hi: Up ^ Ip (1.1) 

where Ip denotes the p— dimensional identity matrix. Note that the identity 






OO 



X 



matrix in (1.1) can be replaced by any other positive definite matrix Eq 



— 1/2 

through multiplying the data by Sq . 

In this work, we assume Hn = p/n — )■ y G (0, oo). For canonical statistical 
analysis, that is the sample size n tends to infinity while the dimension p 
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remains fixed, one can refer to [T]. Wlien y < 1, [3] proposed a correction 
to tlie classic likeliliood ration test (CLRT) and derived tlie central limit 
theorem(CLT) using random matrix tlieory(RMT). When y > 1, CLRT is 
degenerate since the sample covariance is no longer invertible with probabil- 
ity one and [TU] gave a new statistics (LW test) which could accommodate 
situations for any y > 0. We note that LW test has drawn much attention in 
literature which include [16] which considered the test for part of the eigen- 
values of Sp are equal and [7] which extended LW test to cases y = and 
oo. There have been also a substantial set of research works motivated by 
LW test such as P [IE] and [11] . 

However, most of these results included CLRT and LW test were derived 
under Gaussian assumptions or equivalent conditions such as fourth moment 
equals three. The difficulty to relax Gaussian assumptions is due to the cen- 
tral limit theorems for the linear spectral statistics defined by eigenvalues. 
More details can be found in [5] which built the CLT for linear spectral statis- 
tics of large-dimensional sample covariance matrices under the assumptions 
on fourth moments and [15] improved the results for general finite fourth 
moments. The sample covariance matrix defined in [5l |T5] is a simplified 
version of classic sample covariance matrices and the data must had zero 
mean. Recently, [13] derived the CLT for linear spectral statistics of classic 
large- dimensional sample covariance matrices. Some other important results 
include [HI [12] and so on. 

Besides Gaussian assumptions, CLRT in [3] actually is only applicable 
for data with zero-means which is too strict to applications. In this work, we 
redefined CLRT and LW test using classic sample covariance matrices and 
derived the CLTs of the two tests in general conditions which can accom- 
modate data which have non-zero means and non-Gaussian distributions. 
Simulations demonstrate that the proposed tests have good properties in 
sizes and powers. What's more, compared with LW test, CLRT has its own 
advantages on detecting the eigenvalues of Sp near zero. That is CLRT is 
more sensitive to eigenvalues less than 1 while LW test has more advantages 
on detecting eigenvalues larger than 1. In literature, there is also some work 
not based on sample covariance matrices such as [8] which proposed a test by 
constructing estimators from the data directly. We also conduct simulations 
to compare our proposed tests with the one in [8]. 

The paper is structured as follows. Section 2 introduces the basic data 
structure and establishes the asymptotic normality of the CLRT and LW 
test. Section 3 reports simulation studies. All the technical details include 



proofs and preliminary results in RMT are deferred to the Appendix. 

2. Main Results 

We assume the observations Xi, ■ ■ ■ ,Xn satisfies a multivariate model 

(i) 

X, = Sy2y^ + /i, fort = l,--- ,n (2.2) 

where /i is a p-dimensional constant vector and the entries of 3^^ = 0^1 j)pxn = 
(Fi, ■ ■ ■ ,Yn) are i.i.d. with EYij = 0, EFj = 1 and EF^ = 3 + A. Here we 
introduces two versions of the sample covariance matrices. The classic one 
is defined as 

1 " 

Sn = 7 / (Xk — X){Xk — X) , 

fc=i 
where X = - Y12=i -^k and a simplified version has the form 

1 " 
n ^-^ 

k=l 

We refer to [2] and [H] for the differences between Sn and Bn in RMT. Then 
we can introduce CLRT in [3] 

L„ = -tr{Bn) - - log \Bn\ - 1 (2.3) 

p p 

where tr denotes the trace. 

When Xi ~ Np{0,lp) (or Xi ~ Np{fi,Ip) where fj, is known), ^ derived 
the CLT of CLRT 

p{Ln - (1 + (l/2/„ - 1) log (1 - Vn))) 4 N(- log (1 - y)/2, -2y - 2 log (1 - y)). 

where — )■ denotes convergence in distribution and N the normal distribution. 
When p is larger than sample size n, since the sample covariance matrix 
Sn is singular, [10] proposed LW test which is defined as 

Wn = -tr{Sn - I,f - -^{-triSn)f (2.4) 

p n — \ p 
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If Xj ~ Np{^, Ip) and under some other assumptions, [lU] has proven 

nW^„4N(l,4). 
In our work, we will redefine the CLRT as 



and LW test as 
1 



Wr,. 



P 



'^1"{^n -'p 



P 



P 



-triSn) 



los.\S„ 



P 



n — 1 p 



tr(5„))' 



'l + A){n-2)(n- 1) - 2 
n(n — 1)2 



(2.5) 



(2.6) 



• (2.7) 



In (2.6) and (2.7), noting that the statistics are defined by Sn which is in- 



variant under the shift transformation Xj = Xj + c, we can assume /x = in 



(2.2) without loss of generality. By [5], we know almost surely 



P 



\og\Sn\ - -log|Sp| "4 -1 - il/y-l)\og{l-y)) = d{y). 



P 



which means Ln + d{y) is a estimator of (tr(Sp) — log |Sp| —p)/p. Similarly, 
Wn is a estimator of tr(Sp — IpY /p. Noting that 

Ep = Jp ^ (tr(Sp) - log |Sp| - p)/p = ^ tr(Sp - Ipf/p = 0, 



therefore, L^ and Wn can act as the statistics to test ( 1.1 ) theoretically. Next, 



we will establish the asymptotic normalities of L„ and Wn- 
Theorem 2.1. When Sp = Ip and p/y — )■ y G (0, 1), 

p{K - (1 + (l/Z/n - 1) log (1 - yn))) ^ N(m, V) 

where m = y{A/2 — 1) — 31og(l — y)/2 and v = —2y — 2 log {1 — y). 
Compared with [3] which is only applicable to Gaussian data with known 



means. Theorem 2.1| was built under general conditions and can be applied 
to Gaussian data directly. If the population mean is unknown, the CLRT in 
[3] will behaves poorly and the new one are still applicable. Simulations will 
be conducted in subsection 3.1. 



Theorem 2.2. Under Hq and p/n — t- y G (0, oo), 

Due to the test in [H] and LW test are both based on the estimators of 
tr(Ep — IpY /p, foUowing are remarks about these two tests and simulations 
wiU be shown in next section. 

Remark 2.1. Direct calculations can show that when H = Ip, Wn is a un- 
biased estimator of trijlp — LpY . In l^, authors also proposed a unbiased 
estimator Vn oftriTjp — IpY . What's more, under the conditions of Theorem 



2 



Var{W„) -^ %^ and VariVn) -)■ % 

However, for Gaussian data, due to Wn is based on Sn which is a complete 
sufficient statistic for Sp, we can claim Wn is the unique best unbiased esti- 
mator of trijlp— I pY andVariWn) < VariVn) by Lehmann-Scheffe theorem. 
From simulations, we know Ay^ < Var{Wn) < VariVn) — >■ 4?/^ and therefore 
the sizes of our new LW test will behaves better than the test in f^. For 
non- Gaussian data such as Gamma and Uniform variables, simulations in 
subsection 3.2 can show that LW test performs favorably in comparison to 
existing methods. 

Remark 2.2. In ^8], for comparison purposes, authors assumed A = even 
for gamma random vectors and here from Theorem \2.^ we know it is not 



reasonable. From Theorem 2.2, when Sp = Ip and Yij ~ Gamma[4:,0.5], by 
Slutsky's theorem we know 

{nWn-l)/2^N{0.75,l). 

However, in f^, authors still thought {nWn — l)/2 — )■ N(0, 1). Moreover, since 
P{Z > $-1(0.95)) = 0.185 where Z ~ N(0.75, 1) and $ is the distribution 
function of standard norm variables, this explain why the sizes of LW test in 
l^ are near 0.185 not 5%. 



In Theorem 2.2, when Xj ~ A^(/i, Ip) that is A = 0, we can get 



p(;|,;_ ("-2)("-l)-2 )^M(0.V 

n[n — ly 



which is in accordance with (2.5) by Slutsky's theorem. By Theorem 2.1 and 



2.2, the results depend on A. When EYij ^ 0, Theorem |2 . 1 1 and |2 . 2 are still 



applicable under new assumptions E(Yij — EYij) = 1 and E(Yij — EYij) = 
3 + Aby ^. 

3. Simulations 

We report results from simulation studies which were designed to evaluate 
the performance of the proposed identity tests for the covariance matrix. For 
comparison purposes, We also conducted the test proposed by [8](CZZ Test). 
Here, the ratio y could be estimated by ?/„ = p/n and three scenarios were 
considered with respect to the innovation random matrix (l^j)pxn 

(I) Yij i.i.d. ~ Nil, 1) where A = 0; 

(II) Yij i.i.d. ~ Gamma [4, 0.5] where A = 1.5; 

(III) r,j i.i.d. ~ f/m/orm[0,2\/3] where A = -1.2. 

To evaluate the power of the tests, two different population covariance ma- 
trices were considered in the simulations. We set S}, = diag{1.5 I[o,2p], Ip-[o.2p]] 

or Ep = diag{0.5 /[o.2p], Ip-[o.2p]), where [x] denoted the integer truncation of 
X. It is that the diagonal covariance Sp had respectively 20% of its diagonal 
elements being 1.5 or 0.5 whereas the rest were 1. 

3.1. CLRT Tests 

Firstly, we will study our redefined CLRT and existing CLRT. Since the 
existing CLRT in ^3j only can deal with Gaussian variables with known mean, 
in our simulations, for each [p, n), we run 1,000 independent replications with 
real Gaussian variables with means zero. Table 1 reported empirical size 
and power of our redefined CLRT and existing CLRT for Gaussian variables 
Yij ~ N(/i, 1) (/i is known and without loss of generality, we assume /i = 
here). The nominal test level is set to be 5%. 

From Table 1, we know even for Gaussian variables with known mean, 
our CLRT is comparable to one in [3]. As p and n both are increased, the 
size or power of the two tests are quite close to 5% or 1 and make no much 
difference. What's more, for the same sample size n, when y gets smaller, 
the sizes are more close to 5% and the powers are more close to 1. A possible 
explanation is that if we only have n samples, when p gets smaller (that is 
y gets smaller), our redefined CLRT or existing CLRT, as the estimator of 
(tr(Sp) — log |Sp| — p)/p, will become more accurate. 



Table 1: Performances of redefined CLRT and existing CLRT 
Redefined CLRT Existing CLRT 



n 


y = 0.25 


y = 0.5 


y = 0.75 


y = 0.25 


y = 0.5 


y = 0.75 






Sp 


= /p, F.,~N(0,1) 






40 


0.077 


0.072 


0.076 


0.071 


0.061 


0.062 


80 


0.062 


0.061 


0.062 


0.060 


0.056 


0.055 


160 


0.054 


0.053 


0.054 
-y(i) Y-. 


0.053 

~N(0,1) 


0.052 


0.053 


40 


0.220 


0.182 


0.177 


0.214 


0.168 


0.162 


80 


0.397 


0.342 


0.281 


0.397 


0.337 


0.275 


160 


0.819 


0.769 


0.632 


0.816 


0.762 


0.625 


200 


0.926 


0.889 


0.815 
-y(2) y.. 


0.925 

~N(0,1) 


0.895 


0.816 


40 


0.371 


0.369 


0.272 


0.370 


0.367 


0.278 


80 


0.841 


0.749 


0.618 


0.840 


0.762 


0.641 


160 


1 


1 


0.986 


1 


0.999 


0.990 






Sp = 


h. y^^- 


N(l/4,1) 






100 


0.066 


0.051 


0.059 


0.689 


0.837 


0.834 



If the true mean /x is not zero and we still use the CLRT in [3] , the results 
of the last part in Table 1 is the experiment for Yij ~ N(l/4, 1). It can be 
found that the existing CLRT behaves very poorly and our CLRT is still 
applicable. 

3.2. LWtest VS CZZ test 



From Theorem 2.2, we know the CLT of LW test depends on A and it is 
not reasonable that in [8\ authors assumed A = even for gamma random 
vectors. Here we will repeat part of the simulations in [8] and the nominal 
significant level of the tests was 5%. Simulation results are reported in Table 
2 where the performances for Gaussian data are based on 10^ simulations 
and for non-Gaussian data results are based on 10^ simulations. 

Obviously, the proposed LW test is not so bad as shown in Table 1 of 
[8]. We observe from Table 2 that compared with CZZ test, LW test has a 



better empirical size for Gaussian data which were assuring in Remark |2.1 
For Gamma or Uniform variables, the sizes of LW test and CZZ test are 
comparable and when p or n increases, the sizes both tend to the nominal 
5% level. It is noted that Table 1 in [8] was the results for sphericity test, not 



Table 2: Performances of LW test and CZZ test 

LW test CZZ test 

p n = 20 n = AO y = 60 n = 80 n = 20 ra = 40 n = 60 n = 80 

Gaussian random vectors 



38 


0.0659 


0.0569 


0.0601 


0.0551 


0.0730 


0.0573 


0.0637 


0.0576 


55 


0.0693 


0.0611 


0.0593 


0.0537 


0.0732 


0.0616 


0.0600 


0.0532 


89 


0.0650 


0.0558 


0.0603 


0.0531 


0.0692 


0.0585 


0.0620 


0.0550 


159 


0.0635 


0.0600 


0.0608 


0.0564 


0.0673 


0.0609 


0.0598 


0.0569 








Gamma random vectors 








38 


0.096 


0.079 


0.069 


0.065 


0.078 


0.066 


0.064 


0.058 


55 


0.092 


0.089 


0.085 


0.087 


0.069 


0.076 


0.070 


0.089 


89 


0.101 


0.089 


0.070 


0.068 


0.086 


0.075 


0.053 


0.067 


159 


0.086 


0.065 


0.061 


0.062 


0.075 


0.051 


0.055 


0.054 








Uniform random vectors 






38 


0.051 


0.044 


0.046 


0.051 


0.059 


0.050 


0.049 


0.056 


55 


0.064 


0.058 


0.056 


0.043 


0.073 


0.064 


0.059 


0.044 


89 


0.057 


0.049 


0.056 


0.043 


0.069 


0.056 


0.059 


0.047 


159 


0.050 


0.051 


0.056 


0.056 


0.061 


0.056 


0.059 


0.062 



identity test. Since authors claimed the simulation results for the identity 
test followed very much similar patterns to those of the sphericity test, here 
for comparison purposes, we still use Table 1 in [8] for identity test. 

Above all, the performances of the proposed LW test are comparable to 
CZZ test and for Gaussian data, the new LW test even has a better size. 
Compared with CZZ test which does not depend on A, LW test has its own 
advantages on computational cost. Actually, in order to compute LW test, 
we only need the sample covariance matrix Sn while CZZ test is consist of 
five parts. 

3.3. Powers of LW and CLRT tests 

Results were reported in Figure 1, corresponding to innovation random 
matrices (I), (II) and (III) and Sp = Ep or Ep = Ep . The power results 
in Figure 1 showed CLRT and LW tests both approached to 1 when n was 
increased. For Ep , that is part of the eigenvalues of Ep were larger than 1, 

(2) 

the power of CLRT was a little worse than the one of LW test and for Ep , 
when part of eigenvalues were less than 1, CLRT behaved better than LW 
test. The reason is due to the differences between tr(Ep) — log |Ep| — p and 
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Sample Size n 
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Figure 1: Realized powers of LW test and CLRT for different sample size n and p/n = 0.5. 
The top three are for Sp = Sp and the bottom three are for Sp = Sp . 



tr{llp — IpY which offer avenues for further research and we would not pursue 
it here due to a hmited space. 

4. Conclusions 

In this work, we redefined two identity tests CLRT and LW test for high 
dimensional data. Compared with existing CLRT, the new CLRT and LW 
test can accommodate data which have non-zero means and non-Gaussian 
distributions. Simulations demonstrated that the tests had good properties 
in sizes and powers. What's more, from the simulations, we found CLRT is 
not the best one even for Gaussian variables although CLRT came from the 
likelihood functions of normal distributions. Also, the powers of the tests 
(included CZZ test) depend on the population covariance matrix and our 
next work will focus on performances of LW test (or CZZ test) and CLRT 
theoretically. 
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5. Appendix 

5.1. Preliminary results in RMT 

Suppose An is an n X n Hermitian matrix with eigenvalues Ai, ■ ■ ■ , A^. 
Define the empirical spectral distribution (ESD) of A„ as 



1 " 
F^-ix) = -Yl{\<x). 

n ^-^ 

1=1 

The limit distribution of F^^ is called the limiting spectral distribution (LSD) 
of the sequence {A„}. 

And the Stieltjes transform of F"^" is given by 

m^^"(z) = / -^dF^-{x) = -tr(A„ - z/„)"\ 
J X — z n 

where z = ^ + iv eC^ . By the inverse formula, 

F^"{[a,6]}= lim - / Im{m^^'' {x + iv))dx. (5.8) 

Here we need another sample covariance matrix which is defined as 



6. 



1 .", _ 1 

- V(x, - x){x, - xy = - — Sn. (5.9) 



n ■' — ' n 

fc=i 



If the spectral norm of Sp is bounded by a positive constant and F^^ con- 
verges weakly to a non-random distribution if as p — )■ oo, by [17J or jT3], 
with probability 1, F®" and F^" tend to the same probability distribution 
Py,Hi whose Stieltjes transform m = m{z) {z G C^) satisfies 

m= ^ dH(t). (5.10) 

J t{l-y-yzm)-z ^ ^ ^ ^ 

Denoting Gn{x) = piF"^" (x) — Fy^^Hn{x)) , for any analytic function /, J f{x)dG„ 
converges weakly to a Gaussian variable Xf under some assumptions on Sp 
bym. 

When Sp = Ip, Fy^u is standard MP law Fy whose density function is 

9y{^) = ^J{{l + V^?-x){x-{l-^Y)), {l-^f<x<{l + ^f 

11 



.X 



and from (5.10), we know 



1 



m 



1 — y — yzm — z 
Writing m = ym ^, by [14j, we have 

ymfi-^ + j^) A rymfi-^ + jf^] 



EX f 



1 

2TTi 

y 

2TTi 7 (1 + rn){yrn — 1 — m) 



'1 + m)((l + m)2 — cm?) 
f(-- + -^^) 



dm 



2TTi 



I +m) 



dm 



dm. 



(5.11) 



and 



Var{Xf) 



1 

y^ 
'47r2 




f{zi)f{z2 



/(^) .^..^^2 



1 



'1 + m(z))' 



{m{zi) -m{z2)) 
dm{z)Y. 



-dm{zi)dm{z2] 



(5.12) 



The contours in (5.11) and (5.12) are both contained in the analytic region 



for the function / and both enclose the support of Fy^Hni^) for large n. 
Moreover, the contours in (5.12) are disjoint. 



5.2. Proofs of Theorem 2.1 



Writing f{x) = x — log(a;) — 1, we have 



f{x)dF^"{x) 



and 



p{Ln- / f{x)dFy^{x)) 



p{J f{x)dF^-{x)- J f{x)dFySx))+p{j f{x)dF'"{x)- J f{x)dF^"{x)) 
p{ [ f{x)dF^-{x) - [ f{x)dFy„{x)) + {tr{Sn)-log\Sn\-tr{Gn)+\og\Gn\) 



p{ / f{x)dF^-{x) - / f{x)dFy„{x)) + 



n — 1 



tr{G„) -plog( 



n 



n — 1' 



p{ / f{x)dF^-{x) - / f{x)dFy^{x)) + o{l). 
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From [3], p{f f{x)dF'^"{x) — f f{x)dFy^{x)) converges weakly to the Gaus- 
sian variable Xj. Next we calculate the mean and variance of Xf. The 
following results have been given in Bai et.al.(2008) 

f{x)dFy^{x) = 1 - — log(l - yn), 



ymfi-^ + jf^) 1 

*" *" dm = —- \og{l — y) , (5.13) 



27ri J (l + m)((l + m)2-cm2) — 2 

and 

1 f f f{z,)f{z2) 



27r^ 7 J {m{zi) — m{z2)y 
Also, from [15], we know 



dm(zi)dm{z2) = -2y - 2 log (l-y). (5.14) 



2'Ki J m 1 + m (1 + m)'^ 

Therefore, to get EXg, we still need to calculate 

<h log( 1 )- — .n dm 

2'Ki J m 1 + m (1 + my^ 

= -^/log(-l + ^)^(-^+ , ' ,J 
27ri J ^^ m 1 + m^ ^ 1 + m 2(l + m)2^ 

J^ y 

y 1 m? (l+m)2 ,1 1 , , 

'*" ~ ^ —' ' -)dm 



2m J -^ + ^'l + m 2(l + m)2' 

'^ ml -km. ^ / 



m 1+772 



y if 1 + m ym 1 1 



2m J m{ym ~ m — 1) (m + l)(ym — m — 1) 1+m 2(l + m)2 

y L ^ 1 



2TTi J m{ym — m — 1) 2m(m + l)(ym — m — 1) 
1/ 



)(im 



(5.15) 
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and 



f(-- + -^ 



27Ti 7 (1 + rn){yrn — 1 — m) 



dm 



y 

27ri 



m l+m 






[1 + m) {ym — 1 — m) 



dm 



1/",,1 y .. 1 1 ,, 

-y + TT^ log + ^ ^ —)dm 

zm I m, 1 + ml+m m, 



y-i 



= -l/-log(l-l/) 
where we used the following results 

1 



(5.16) 



[m + l)''{ym — m — 1 
and one equality in [S] 



-dm, = 0, k = 1,2, 3, 



TTZ 



log I 



. _)(^ 1 



m l + m l + m m 



-^)rfm = -2\og{l-y). 



y-i 



By ( [5l3| ), ( [5l5| and ( [5l6| ), EXj = |/(A/2 - 1) - 31og(l - y)/2. 
Through a routine calculation, we have 



2Txi J m l + m (1+m)^ 



(im = 



(5.17) 



By (5.14) and (5.17), we have Var{Xf) = -2y - 2\og{l - y). 



The proof of Theorem 2.1 is complete. 



5.3. Proofs of Theorem \2.2 



Noticing that p(-tr(©„) — 1) satisfies CLT and -tr(©„) — )■ \,a.s., we 
have 

p{{Ur{&n)f - 2-tr(6„) + 1) = p([-tr(6„)] - 1)^ = o(l). 



P 



P 



P 
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By (5.9), we have 



p[Wn 



-triGn - Ipf - -tr{&n) + -)) 
n n 



P 



Ti 1 

tr{ ,©„- 1^)2 _tr(6„- 1,^2 ■ ' 



n 



'n — 1 



'py 



n {n — 1)^ 



)(tr(6„))2 + o(l) 



n" ,x . ,^2^ ~, IT' .X . ._, X ,1 



-y' + o{i] 



-l)tr{ei)-2( 



n — 1 



l)tr{Gn) + 



n 



n {n — 1)' 



:){tr{&n)f 



and 



p{-tr{&,, - I^f - -tr{&,^) + ^) - /((x - If - 2yx + y)rfp(F®"(x) - F,„(x)) 
p n n J 

= {y- yn){2tr{&n)-p)+p j{{x - if - 2yx + y)dFySx) 

= 2{y - yn){tr{Gn) - p) 

= 0(1) 

Writing 5f(x) = {x—lf—2yx+y, by [Hj, J g{x)dp{F'^"{x)—Fy^{x)) converges 
weakly to gaussian variable Xg and through a routine calculation 

EXg = y + yA + y''=y{l + A + y). 

To get the variance Var\Xg), we need 

g{z{mi)) 



{iTil — 7712)2 



(imi 



:[( + 



y ,\2 , o../ 1 , 1/ 



(7711—7712)2 TTii 1 + 7?7i 

Airiy'^ Aniy'^ 47riy'^m2 



lf + 2yi + 



7?7i 1 + 7?7i 



+ l)]dmi 



(7772 + 1)3 (7772 + 1)^ ("^2 + 1) 



2y' 



TTl 



g{z{m)) 



m 



(m + 1)3 



dm = Ay"^ 



and 



1. [ g( ^ \ y ) ^ 

27X1 J 771 1+771 (1+7/7)2 



dm = 



15 



Then 

Var{Xg) = %2 
Above all, we can get 



By Slutsky's theorem, the proof of Theorem 2.2 is complete. 
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